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Abstract 

Decoding of convolutional codes poses a significant challenge for coding theory. 
Classical methods, based on e.g. Viterbi decoding, suffer from being computation- 
ally expensive and are restricted therefore to codes of small complexity. Based on 
analogies with model predictive optimal control, we propose a new iterative method 
for convolutional decoding that is cheaper to implement than established algorithms, 
while still offering significant error correction capabilities. The algorithm is partic- 
ularly well-suited for decoding special types of convolutional codes, such as e.g. 
doubly cyclic convolutional codes. 
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1 Introduction 

A central aim of coding theory is to protect transmitted or stored information 
against errors. A common technique is to split the sequence of information 
symbols into blocks of constant length and map each block injectively to a 
codeword of larger length. This map is called the encoding map and its image 
space is referred to as a block code. To protect the process from transmission 
errors requires a procedure that enables one to recover the sent message from 
the received one by projecting it back to the code. This is called decoding and 
forms the basis of error correction algorithms of current codes. Decoding is 
an inherently difficult task, but for block codes effective decoding algorithms 
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are available that depend on the special algebraic structure of the codes; e.g. 
BCH codes, Reed-Solomon codes, or list decoding techniques, see e.g. [7116] . 



In the sequel, we will consider block codes only as an intermediate step for 
decoding convolutional codes. Such convolutional codes are a natural gener- 
alization of block codes and have found widespread applications; see [TC)][T2"] . 
Algebraically, they are defined as submodules of F[z] n , spanned by the columns 
of a full rank rectangular polynomial matrix G(z) G F nxfc [z]. In contrast to 
block codes, where a rather rich theory is available, there is no really effi- 
cient decoding algorithm known for convolutional codes. Classical decoding 
algorithms for convolutional codes such as e.g. Viterbi decoding [6] work well 
only for codes of moderate dimensions. There is thus considerable interest in 
developing efficient decoding algorithms for convolutional codes that improve 
known algorithms, such as Viterbi decoding. 

Since convolutional codes can be interpreted as linear control systems defined 
over a finite field F, see the survey paper [T3] and the references therein, it is 
possible to apply the rich tools from linear systems theory and optimal control 
to study such codes. This approach offers a better understanding of convolu- 
tional codes and led already to new construction methods and algorithms; see 
[71l2l^lfT^lfT5] . From a systems theory point of view, decoding of convolutional 
codes can be interpreted in at least two different ways [IB] . One interpretation 
treats decoding as a tracking problem, where the decoder attempts to track 
the received message by the most probable codeword sent. Another perspec- 
tive lies in treating it as a filtering problem, where the decoder is requested 
to filter the noise introduced by the channel. 

In this work we will follow the first approach, by treating the decoding prob- 
lem of convolutional codes as a tracking exercise for linear systems to replace 
the received message by its closest polynomial codeword [IB]. Motivated by 
analogies with model predictive control, we propose a simple receding hori- 
zon algorithm for convolutional decoding, that combines an arbitrary block 
decoding algorithm with a few receding horizon steps. The proposed method 
has the advantage of being computationally cheaper than well-known Viterbi 
decoding, while still achieving substantial error correction. In a companion 
paper [BJ we show that our algorithm can lead to competitive decoding results 
for the class of doubly-cyclic convolutional codes. 

The paper is structured as follows. Section 2 introduces basic terminology from 
coding theory and outlines a decoding procedure for convolutional codes via 
the Bellman optimality principle. In Section 3, the new algorithm is proposed 
and error correction capabilities of the method are established. Examples are 
discussed in section 4 to illustrate the decoding properties of the algorithm. 
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2 Dynamic programming approach to convolutional decoding 



2.1 Coding theory terminology 



We begin with a brief summary of basic notions from coding theory. For further 
information we refer to standard textbooks as, e.g., 09]. Throughout this 
paper, F denotes a finite field with q elements. Given any full column rank 
matrix G G F nxfe , a linear block code of length n and dimension k is a k- 
dimensional vector space 

C := {Gv\v G ¥ k } C F n 

and G is called a generator matrix of C. If, possibly after a permutation of the 
rows, G G ]p nxfc i s of the form 



G 



(a 



h 



then G is called a systematic generator matrix. The check matrix of the code 
then is defined as 

S = (l n - k -A 



and, since C = KerS, it provides a kernel description of the code. A natural 
metric on a code is defined by the Hamming distance. The Hamming distance 
of vectors x, y G F n is defined by d(x, y) := w(x — y), where 

w(c) := #{i| Ci ± 0}. 

denotes the Hamming weight. The minimum distance of a code C then is the 
minimum among the distances between any pair of codewords, i.e. 

d(C) := min w(c). 

cec\{o} 

It measures the error correcting capabilities for the code. In received 
vector x G F n does not belong to C, i.e. if some error has occurred in the 
encoding process, a natural way to recover the sent message is by taking the 
maximum likelihood estimate of x, i.e. to replace x by the closest vector from 
C. This process is called decoding. A decoder is a map it : F" ' — > C that is the 
identity on C. A special case in point are maximum likelihood decoders that 
are defined by the set valued map 

7Tc(x) := argmind(x, c) C C. 
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In order to achieve the best error correction properties of decoding, the vectors 
from C should be as far apart as possible. Thus one is searching for codes whose 
minimal distance is a large as possible. The maximum number of errors that a 
code can correct is known as the error correction capacity of the code, which 
is half the minimum distance of the code minus one. If a code of length n 
has minimum distance d = d(C), then the spheres centered at the codewords 
with radius \_^-\ are disjoint. Any vector from F™ contained in one of these 
spheres can thus be decoded to its unique nearest codeword, the center of 
the sphere. But the union of these spheres may not cover all vectors of the 
ambient space. The covering radius pc of a code C is the maximum distance 
from any vector of F n to its nearest codeword. The covering radius of a code 
is the smallest radius needed for spheres centered at the codewords to cover 
the whole ambient space. 

2.2 Convolutional codes and Bellman principle 

In this paper we follow the well-established approach, see e.g. [13], that re- 
gards a convolutional code C of length n, dimension k and complexity 5 as a 
submodule 



defined by a full column rank polynomial matrix G(z) E F nxfc [2:], k < n. 
Here the complexity 5 of C, also referred to as the "constraint length" of the 
code [TTj . is defined as the maximal degree of all k x k minors of G(z). 

After a suitable permutation of the rows, we can assume that the generator 
matrix is of the form 



with right coprime polynomial factors P(z) E Y ( > n ~ k * >xk and Q(z) E W kxk , 
respectively. Here 5 = degdetQ(z) is assumed to be the maximal degree of all 
kxk minors of G. Therefore the transfer function P(z)Q(z)~ 1 E ¥^ n ~ k ' )xk (z) is 
proper rational of McMillan degree S and thus has a minimal (i.e. controllable 
and observable) state space realization 

x t+ i = Ax t + Bu t , x = 
yt = Cx t + Du u 

A E ¥ SxS , B E ¥ Sxk , C E F(™- fc ) x<5 , D E Y^ n ^ xk , x E¥ 5 ,u E ¥ k y E ¥ n ~ k . 
Conversely, given any such linear linear systems representation over the field F, 
then a right coprime factorization of the transfer function C(zls— A)~ 1 B+D = 
P(z)Q(z)' 1 defines a convolutional code with generator matrix flTJ. Thus the 



C := {G(z)v(z)\v(z) E ¥ k [z]} C ¥ n [z] 




(1) 
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complexity of the code corresponds to the McMillan degree of the associated 
rational transfer function, i.e. to the dimension of the state space of the asso- 
ciated controllable and observable linear system. 

In the above framework, codewords of a convolutional code correspond to poly- 
nomials c(z) = °i z% £ C with a finite number of coefficients cq, ■ ■ ■ , c 7 £ F n . 
We extend the Hamming distance on F n to a metric on vector polynomi- 
als ¥ n [z] via dist(c(z), c(z)) := d(ci, c 0- Given any polynomial c(z) = 
J2j=o Ci z% £ ¥ n [z] the task of minimal distance decoding then asks to find a 
code word c opt £ C that minimizes the distance to c, i.e. 

c op t := argmindist(c, c) 



Such an optimal codeword always exists, since F n is finite, but need not be 
unique. The map c i— > c opt is called a minimal distance decoder for C. The 
method of dynamic programming yields a way to calculate this optimal code 
word c opt for each specific choice of c. 

To see how this works we first give a linear systems interpretation of the code 
words of C. This requires that ([2]) is observable. Rosenthal [13] has shown that 
a polynomial c(z) = J2l=o c % z% is a code word of C if and only if and only if 



/ A~<B A^~ l B . . . 
D 

CB D 

-I CAB CB D 



B \ 



\ 



CA^ l B 



Hi 



CB D) yujj 



0, 



(3) 



where I denotes the (7 + l)(n — k) x (7 + l)(n — k)- identity matrix and 



Ct 



Vt 



£ F (n " fc) x F fc , t = 0, 



7- 



The input sequences u , ■ ■ ■ , « 7 in (j2J) that define an admissible code word 
c are therefore just those controls that steer the initial condition xq = in 
finite time back to xq = 0. It is easily seen that there exist always a nontrivial 
choice of such input sequences. Moreover, the minimal time to steer back to 
x = via a non-zero input is at least 7 > K min ; n m i n < |_f J being the smallest 
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controllability index of <$fy. Let U = proj(C) denote the submodule of ¥ k [z], 
obtained by projection of C onto F fc [z]. By coprimeness of P, Q it follows that 

U := Q(z)¥ k [z] = {Q(z)u(z) e ¥ k [z] \u(z) e ¥ k [z] }. 

Moreover, expressed in terms of controllable and observable realizations (A, B, C, D) 
of P(z)Q(z)~ l , we obtain the state space description 

U = {u(z) = jr Ul z l G ¥ k [z] | £ A l B Ul = 0, 7 e N }. 

i=0 i=0 

In linear systems theory, U is therefore referred to as the module of zero return. 

Now assume, that we want to decode a polynomial c G ¥ n [z] via ([2]) with 
coefficients 

c t = ^1 G F (n " fc) x F fe , t = 0, • ■ • , T 
\ut) 

By the above, this means to solve the optimal control problem of finding 
an admissible input function u G U that minimizes the tracking error cost 
functional 

oo 

J(x ,u) := dist(c,c) = J2 k t(x t ,u t ) 

t=0 

where 

k t (x,u) := w(u - u t ) + w(Cx - y t + Du), t G N . (4) 

Note that the above series is always finite, as the inputs u(z) are constrained 
to be admissible polynomials. Thus this is an optimal control problem where 
the classical Z 2 -distance from linear quadratic controller design is replaced by 
the Hamming distance. Let 

Vooixo) = inf J(x , u) (5) 

uGA 

denote the optimal value function. The optimal control can then be computed 
via the Bellman principle, although this is a bit complicated here due to the 
varying length of the inputs. Thus we do not pursue this approach here. A sim- 
plified analysis can be given under the assumption that all data are available 
over a fixed horizon T. Thus assume, we consider the task of minimizing 

T 

J(x ,u(-),T) = Y^h(xt,Ut) 

t=0 

with kt(x,u) as in (TjJ and we optimize over all input sequences ito, ■ • ■ , ut- 
For JV = 0, ■■■ ,Tlet 

T 

V N (x) = min ^ k t (x t , u t ) 

t—N 
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denote the N-th. value function, where minimization occurs over all 

sequences u^, ■ ■ ■ ,uj<. By the Bellman principle, these functions satisfy the 
functional equation 



Applied this approach to the negative log-likelihood function of the channel 
output, this then becomes exactly the Viterbi decoding algorithm [11]. How- 
ever, the computation of the value functions is still computationally expensive 
and success of this method is therefore restricted to codes of small complexity. 
In the next section we will give a somewhat easier model predictive control 
approach. 



3 Convolutional decoding via receding horizon 

We now show how a modification of the receding horizon method from optimal 
control leads to an effective decoding algorithm for convolutional codes. The 
solution is achieved by means of a slight modification of the classical receding 
horizon method were L, instead of 1, vectors are taken to update the solution. 
Moreover, we carry out the main minimization step by a suitable decoding 
algorithm for a certain block code associated to the system. Throughout this 
section we assume that ([2]) is controllable and observable. This is actually a 
natural and frequently used assumption [T3] . 

We assume the following data are given: 

(1) A finite sequence of received, to be decoded, words 




x t+1 = Ax t + Bu t (x t ), £ = (),•■■ , 



T-1. 




for arbitrary T 6 N . 
(2) Positive integers L < N < T. 



We then attempt to iteratively minimize the finite cost function 



T-1 



J(x Q ,U,T) = J2( W (yi ~ Vi) + W ( U i - U i)) 



(6) 



i=0 
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over the set of admissible inputs u G U. For this we note, by controllability 
of that any unconstrained minimum u* of the cost function ([6]) can be 
extended to an admissible input ii a dm £ W by steering the terminal state xt 
to zero. Of course, this admissible solution it adm will not necessarily be an 
optimal solution of (jUJ). 

The iteration steps to be followed at every time instant t G No are 

(1) Consider the initial (known) state clS Xf . 

(2) Solve an iV-step finite horizon tracking problem, i.e., find the uncon- 
strained input sequence {ut+i}^ 1 which minimizes 

N-l 

J(x t ,u, N) = [w(yt+i - m+i) + w(ut+i - ut+i)] 

i=0 

(3) Update the solution input with {u t , . . . , u t +L-i} and use it to update the 
solution output with {y t , . . . , y t+L _i} and to calculate x t +L- 

(4) Update the time instant t with t + L until t = T. 

The last step of the algorithm then results in an input sequence u , ■ ■ ■ , «t-i- 
By controllability of (A,B), we can extend this sequence to an admissible 
input sequence uq, ■ ■ ■ , ut, «t+i, ■ • • , ut+t, by steering the final state %t into 
xt+t = 0. Here, r < K max , with K max the largest controllability index of (A, B). 



The obtained input sequence w , 
and 



, u T+T with associated outputs y , ■ ■ ■ , y?-\ 
GF M) xF 1 ', t = 0,---,T + r, 



then defines a codeword c(z) = Y,J=o CiZ 1 E C that serves as a decoding for c. 
We emphasize again, that due to the unconstrained minimization, this is not 
an optimal solution to the above tracking problem. 

Step 2 represents the main problem to be solved, which we now replace by a 
general block decoding step. By inspection, J(x t ,c,N) = itf(6v)> f° r a vector 
Cjv = z t,N + B N u tyN G ¥ Nn defined by 



/CA N - 1 \ 



Zt,N 



C 




( Vt+N-l\ 

m 



(D CB CAB 
D CB 



B 



N 



ca n ~ 2 b\ 

CA N ~ 3 B 

CB 
D 



iNk 



Ut,N 



/«i+JV-l\ 
Ut+N-2 
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Thus, each minimization step of length iV can be solved by decoding the vector 
Zt t N with respect to the block code Cn generated by B N . At this point we make 
contact with coding theory. Consider the block code 

C N = {B N u | u G ¥ Nk } C ¥ Nn 

with generator matrix Bn- The maximum likelihood decoder of Cn then is 

ttc n (zt.N) := argmind(z tjN ,v). 

vGCn 

In particular, if we decompose any vector v G 7Tc N (z tj N) as v = G F n ~ fc xF fc , 
then the optimal vector u is equal to — v<i. 

By replacing step (2) by a suitable decoding algorithm for Cn, we obtain 
the following algorithm where decoding of Cn is represented by the function 
Decoding. Note that, for ease of notation, we have written the column vectors 
w, z — e, u in row vector form. 



Receding Horizon Decoding Algorithm 
Input : 

* x o> {ijt}o> A, B, C, D (*they describe the system*), 

* iV Olength of finite horizon*) 

U = Empty_List (*the solution sequence*) 

t = 

Wile Exit=N0 

w = (CA N ~ l x t - yt+N-i, • • • , Cx t -y t ,0,...,0) 
e =Decoding(i?7v, w) 

( C AD ■ ■ ■ ) c 'ii C JV) • • • > c i) = z — e 
u = -(cf ) ...,c£) 
U = Append u to U 

x t+L = A L x t - Ef = i A L ~ l Bdl 
t = t + L 

If t>T, Exit=YES 
End-While 

Extend U to an admissible input sequence 
Output: U 



The whole algorithm takes at most many steps. Parameter L is directly 
related with the number of Uj that can be correctly decoded at each iV-step 
problem. The precise relationship depends on the code Cn and it is given in 
Theorem 13.21 Moreover, in each step the decoding problem may have more 
than one solution. We address this issue in the final section of the paper. 
Although convergence to the optimal solution of the tracking problem may 
not occur, we can at least derive an upper bound for the achieved cost. 
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Proposition 3.1 Let pn be the covering radius of the code Cn and let u denote 
any input sequence that is produced by the algorithm. Then 



J(x ,u,T) < 



Pn 



Proof. For any vector z G ¥ Nn , there exists a codeword at distance less or 
equal to pn- Thus the cost added to the functional at every iteration step 
t = 0, L, 2L, ■ ■ ■ , is upper bounded by 

min{w(z t! N + Bnu)} = w(e) < pn- 

Therefore, the total cost obtained after steps is upper bounded by |"^]pat. 
□ 



The number L of steps to update can be chosen in dependence on the cor- 
rection properties of the code Cn- Recall the notion of decoding error, which 
occurs when the decoding algorithm outputs a codeword different to the orig- 
inal one. A case in point here is where in the iterative method the solution is 
not updated with the entire codeword from Cn but with the components cor- 
responding to u t , . . . , u t+ L~i- Those decoding errors which do not affect these 
components will be called admissible decoding errors. The precise connection 
between L and N is given by the following theorem. 

Theorem 3.2 Let dw be the minimum distance of the code Cn- The decoding 
scheme can correct |_^J errors, d' > d]y — 1, up to an admissible decoding 
error, if and only if in every codeword from c G Cn of weight w(c) < d' all the 
components C(jv-L)(n-fc)+i, • • • , c N ( n - k ) and c Nn - Lk+ i, . . . , c Nn (i.e., those that 
don't admit a decoding error) are zero. 

Proof. Note that the generator matrix P>n is systematic, and the check matrix 
of the code is well known to be 



/ 



H 



N 



D CB CAB 
D CB 



-Id 



N(n-k) 



\ 











CA N ~ 2 B 
CA N - 3 B 

CB 
D 



J 



Note that Hn corresponds to the kernel representation matrix Q, up to a 
change of ordering of time indices to decreasing order and removing the first 
block row ([3]), that corresponds to the zero return condition. 

The minimum distance e^v of the code Cn is precisely the minimum number 
of linearly dependent columns of H N , [I], as the coefficients of one such linear 
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dependency would be the components of a codeword from Cn. This bounds 
the number of errors that can be corrected in an iV-step. Note, however, 
that after every iV-step decoding, the method updates the partial solution 
just with u t , . . . ,u t+ L-x, i. e., decoding errors that occur in the components 
corresponding to u t +L, ■ ■ ■ ,u t +N (and hence also in those corresponding to 
yt+L, ■ ■ ■ , 2/t+jv) are admissible. The set of components corresponding to these 
vectors that allow errors is a = {1, ... , (N — L)(n — k), N(n — k) + l, . . . , Nn — 
Lk}. Let us denote its complementary by a. 

An admissible error, a vector with support in a, is also a codeword: an ad- 
missible error is the difference between the codeword sent and the codeword 
wrongly decoded, and by linearity the difference of two codewords is also a 
codeword. 

Let us assume that the code doesn't allow to correct error vectors of weight 
t! = L^-J up to an admissible decoding error. Then, there exists a vector v 
such that for two different codewords c, d G Cb n it can be written as v = c + e 
and v = d + e' with w(e), w(d) < t! . Since decoding up to an admissible error 
is not possible, we have that Cg, 7^ d a , i. e., e„ 7^ e' a . Then, c + e = d + d 
and by linearity c — d = d — e = c" is a codeword from with weight 
w(d') < w(e) + w(d) < It' < d! and such that 7^ 0, which contradicts the 
assumption of the theorem. The inverse implication is immediate. □ 

As a consequence, the decoding property of our algorithm is as follows. 

Corollary 3.3 Let L, N , Cm and d! be as in Theorem \3.S[ Then: 

(1) The output of the algorithm is a codeword from the convolutional code. 

(2) If in every subsequence 

(CjL, CjL+l, • • • , CjL+N-l), j > 0, 

of the received sequence the Hamming weight of the error is at most d' , 
then the algorithm recovers the original convolutional codeword. 

Proof. (1) Every subsequence (cjl, CjL+i, ■ ■ ■ , cu+\)l-i) that is generated by 
the algorithm at each step is in the kernel of H^. Moreover, considering the 
way in which x t+ L is updated in the algorithm, it follows that the sequence 
{ c t}J=o is i n the right kernel of the submatrix obtained by removing the first 
block row in ([3]). The last step of the algorithm consists in extending the 
sequence so that the last state becomes zero. Therefore the output of the 
algorithm is also in the kernel of the first block row of ([3]) and is therefore a 
codeword. 

(2) is a direct consequence of Theorem 13.21 □ 
Example 3.4 Let us consider the convolutional code over¥ 5 generated by the 
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matrix 



1 A + z 
3 z 
1 



P(z) 



which as we have seen before can be regarded as the linear system described by 
the equations 

x t+ i = (1,2) u t 

y t = Ax t + (1,3) ut , 

x = 

i. e., it has a minimal realization (A, B, C, D) = ((0), (1, 2), (4), (1, 3)). 

Let us fix N = 2. Then our algorithm will work with the received vectors v t , 
Vt-i at each time instant t, when a vector must be decoded with respect to the 
code that has as check matrix 



H 



N 



10 13 4 3 
10 13, 



where the columns 1, 3, 4 correspond to the coordinates ofvt- We observe that 
although the minimum distance of the the code is 2, there is no codeword of 
weight < 2 with support in the positions 2, 5, 6. Then if we fix L = 1 we 
can allow errors in coordinates 1, 3, 4 (vt-i will be correctly decoded) and in 
exchange be able to correct one error. In this way our scheme will produce the 
correct Vt-i in each decoding step. 

The convolutional code has parameters [n, k, 5, df ree ] = [3, 2, 1, 3], and in par- 
ticular it allows the correction of one error. Hence our algorithm takes full 
advantage of the error correcting capacities of the code. 



4 Appendix: Uniqueness of decoded sequences 



An important aspect of decoding is whether it has one or more solutions. Since 
our algorithm works sequentially we are interested to know whether there is 
only one list of vectors u t , . . . , u t+L _i in each block decoding step. Note that 
since the matrix has maximum rank, then Ut, . . . , Ut+L-i are uniquely 
determined by a given block codeword at each step of the algorithm. Hence 
the question reduces to the one of how many convolutional code words are 
closest to a received message. 



12 



Example 4.1 Let us consider over ¥2 the finite time tracking problem prob- 
lem with N = L = 1 defined by the matrices 



A 




D 




C 




D 




Assume that the problem is solved up to a time instant t and let xt = (1,0), 
u t+ i = (0, 0) and y t +i = (0, 0). We need to decode the vector = (0,1, 0, 0) T 
with respect to the code generated by the matrix 



B 1 



However both the vectors (0,0, 0,0) T and (0,1,1, 0) T belong to the code and 
are at a Hamming distance of 1 from the vector that we want to decode. Hence 
both the corresponding values for u t , (0,0) and (1,0), would be equally valid. 

We consider then a problem with window length N = 2. Let Ut+2 = (0,0), 
y~t+2 = (1 , 0) . Then we have to decode the vector (1, 1, 0, 1, 0, 0, 0, 0) T with 
respect to the code generated by the matrix 



Bo 





(° 


1 


1 


i\ 


D CB^ 


1 


1 


1 
















1 


D 








1 


1 


I 


1 
















1 








1) 








1 







V° 








V 



The (unique) closest codeword to that vector is (1, 1, 0, 1, 0, 0,1, 0) T and hence 
there is a unique solution which yields u t = (1,0). 

To study the probabilities of multiple solutions we count the number of vectors 
that can be uniquely decoded, which are those inside the largest disjoint balls, 
centered at the codewords. 



Consider a code C of length n, dimension k and minimum distance d defined 
over the field F with q elements. Its error correction capacity is t = L^pJ • Each 

ball of radius t contains exactly y~] [ . Uq — l) 1 vectors, and there are q k of such 

i=0 Vv 

balls (as many as codewords). Then, the ratio of the number of uniquely de- 
codable vectors with respect to the cardinality of the whole ambient space F n , 
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known as the density of C, is 



Sr. 



qn—k 



The probability that a randomly chosen vector is out of all these balls is 
PS = 1- Sc- 

Before addressing the next result, and for the sake of simplicity, we fix the 
following notation 



Ek,t 



s(?) ( «- i: 



Theorem 4.2 Given a finite horizon tracking problem of window length N , 
the probability that there are M different solutions which differ in A consecu- 
tive vectors can be upper-bounded by 

&k,t i=N 

Proof. To have M different solutions that differ in A consecutive vectors 
means that at some step of the algorithm all finite horizon tracking prob- 
lems with window length < A have more than one solution, and that they 
cannot be discriminated with a larger window, i. e., all solutions agree in the 
last input vector u t+ &. 



On each tracking problem with window lengths N < I < A the vector to be 
decoded is further than the error correction capacity of the code Q, and as 
seen before, the probability of this to happen is Pg . Since this is the case for 
all I — N, . . . , A, the probability that all finite horizon tracking problems with 
window length < A have more than one solution can be upper-bounded by 

P°c N ---- p L=f{ p0 c l ■ (7) 

i=N 



Let us study now the probability that M different optimal solutions of a finite 
horizon tracking problem of length A have the same solution vector u t+ &. 
Let w t)N = (w lt A, Wi,a-i> ■■■■> w i,u w 2,Ai w 2,A-u ■ ■ ■ > w 2,i) be the vector to be 
decoded. For each of the M solutions, {u J t+i }fLo with j < M and u] +A = . . . = 

M i+A' ^ 

A 

i=l 
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Note that the fact that the M solutions have the same Ut+A is equivalent to 
the M vectors u- 7 = (w{ a ,io 2; a) are decoded to the same codeword of the 
block code C\ and all of them have error of the same weight (since all M 
solutions up to instant A — 1 are also optimal, otherwise some would have 
been discarded for smaller window lengths, and hence contribute the same to 
the cost functional). 

Then, let c e Cb 1 be the codeword to which v 1 is decoded, c = {c\, c 2 ) according 
to the splitting of the vK We have d(c,v l ) = e < t = L^rJ- The probability 
that for all j — 2, ... , M, is also decoded to c and d(c, v^) = e depends on: 

• the probability that there are a error components in the part u>2,a (which 
is a common part of length k for all i> J ) 

P [d c 2 , w 2 ,a) = a)= = - := P ( 

#{v\d{c 2 ,v) < t} V( J(5-l) 

r=0 



r 



r 



the probability that d(c, v 1 ) = e = a+/3 (e < t) provided that d(c 2 , w 2j a) = a 

C7)(?-i) /3 



P(d(c, v 1 ) = a + P\ d(c 2 , w 2A ) = a)= .\ p ' - . := P ( 



e c,a 



=0 



•S 



• the probability that for each j = 2, . . . , M d(c, v^) = e provided that 
d(c 2 , w 2: a) = a and d(c, v 1 ) = e 

P(d(c, v J ) = e\ d(c 2 , w 2 ,a) = a, d(c, v 1 ) = e) = 13 J ^_ k := P^| c>a , e 

Considering all the possibilities for values of a and (3 (and hence of e) we have 
that the probability that all vectors v J are decoded to the same codeword of 
Ci and that their errors have the same weight is 

t t-a M 

^ t ^ ] Pc,a ' Pe\c,a ' J_ J_ Pyi \c,a,e 
a=0 f3=0 j=2 

5Q.- r g(C--)..-< , (gf;')..--)" « 

q( M-i)( n -k)^r ( \(q-iy 



r=0 



, r 
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Taking into account that 



t-a 



/ n- k\ , _ „\ M (L-^/n-k 



M 



©(?-i) Q <(©(9-i: 



a\M-l 



u<>-*r<&">-*r*&<>-^>-*" 



?oC) (9 " i)Q ^C^ fc ) (g " i)/3 "^0 (9_i) ' (equaiity ^ k > n ~ k - t) 

then ([H]) is upper-bounded by 



13 ' < ^ . (9) 



M-l 
Ci 



Thus, the probability that there are M different solutions which differ in A 
consecutive vectors is upper-bounded by the product of (J7]) and 



rM-l A 

-j^n^- (io) 

&k,t i=N 



□ 



5 Conclusion 



We develop a system theoretic approach towards convolutional decoding, fol- 
lowing the well-known interpretation of convolutional codes as linear systems. 
The Bellman optimality principle, applied to optimizing the Hamming dis- 
tance function for linear systems over finite fields, then yields an optimal 
control decoding algorithm that is closely related to the Viterbi algorithm. 

To obtain an algorithm with lower computational cost, we propose a model 
predictive control algorithm, using a receding horizon iteration. This new al- 
gorithm has good decoding properties, as it yields desired codeword as long 



1(3 



as there are not too many errors on N consecutive received vectors. We also 
estimate the probability that the algorithm computes a unique solution. 
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